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Abstract 

A distance labeling scheme is an assignment of bit-labels to the vertices of an undirected, unweighted 
graph such that the distance between any pair of vertices can be decoded solely from their labels. We 
propose a series of new labeling schemes within the framework of so-called hub labeling (HL, also known 
as landmark labeling or 2-hop-cover labeling), in which each node u stores its distance to all nodes from 
an appropriately chosen set of hubs S{u) C V. For a queried pair of nodes (u, v), the length of a shortest 
u—u-path passing through a hub node from S(u) D S(v) is then used as an upper bound on the distance 
between u and v. 

We present a hub labeling which allows us to decode exact distances in sparse graphs using labels 
of size sublinear in the number of nodes. For graphs with at most n nodes and average degree A, 
the tradeoff between label bit size L and query decoding time T for our approach is given by L = 
Cl(n log log^ T/log^ T), for any T < n. Our simple approach is thus the first sublinear-space distance 
labeling for sparse graphs that simultaneously admits small decoding time (for constant A, we can achieve 
any T = cj( 1) while maintaining L = o(n)), and it also provides an improvement in terms of label size 
with respect to previous slower approaches. 

By using similar techniques, we then present a 2-additive labeling scheme for general graphs, i.e., 
one in which the decoder provides a 2-additive-approximation of the distance between any pair of nodes. 
We achieve almost the same label size-time tradeoff L = Cl(n log^ log T/log T), for any T < n. To our 
knowledge, this is the first additive scheme with constant absolute error to use labels of sublinear size. 
The corresponding decoding time is then small (any T = w(l) is sufficient). 

We believe all of our techniques are of independent value and provide a desirable simplification of 
previous approaches. 



1 Introduction 


Distance labeling schemes, popularized by Gavoille et al. [^, are among the most fundamental distributed 
data structures for graph data. The design problem combines two major challenges. First of all, distance la¬ 
belings serve the role of a distance oracle, i.e., a data structure which for a given undirected graph G = (V, E) 
can answer queries of the form: ‘Vhat is the distance between the nodes s,t G VT\ Throughout most of 
this paper, we will assume that G is an unweighted graph with n nodes and m edges. The efficiency of a 
distance oracle is measured by the interplay between the space requirement of the data structure represen¬ 
tation, the encoding time required to set up the oracle for a given graph, and perhaps more importantly, its 
decoding time, that is, the time of processing a, s — t distance query. Moreover, a distance labeling scheme 
is defined more restrictively than a distance oracle, as an assignment of a binary string (label) label(M) to 
each node u G V, &o that the graph distance between u and v is uniquely determined by the pair of labels: 
label(M) and label(n). The size of a distance labeling scheme is now the maximum length of a node label in 
the graph. In this way, distance labelings add an extra layer of complexity to the graph distance decoding 
problem, by imposing a distributed representation of information in the labels (label(M) : u G V). Whereas 
the concatenation of all n labels in a distance labeling forms a centralized distance oracle, distance labelings 
can also be applied in a distributed setting, in which the label of each node is stored at a distinct location 
in the network. This is the case, for instance, in applications in compact routing protocols, where the goal 
is to find a shortest path from a source node to a target node with a known label . 

An interesting characteristic of the problem of distance oracle design for sparse graph is its inherent link 
to an underlying set intersection task. On the side of lower bounds, this is most clearly observed, following 
Patra§cu and Roditty |21| , when we consider a pair of vertices belonging to the same partition of a bipartite 
graph. The distance between them is 2 if and only if the sets of their neighbors intersect, and at least 4 
otherwise. Consequently, assuming a plausible conjecture on the space required to decide intersection of a 
set of small sets, it follows that any oracle for graphs with 0{1) maximum degree, which admits constant 
decoding time, requires D(n^) space. (Here, the O and D notation disregards polylogarithmic factors in 
n.) By contrast, many efficient algorithms for answering distance queries in real-world scenarios rely on the 
premise that the distance between a pair of nodes can be computed using an intersection-type query on a 
pair of small sets. In the basic framework of hub labelings, see [^, (introduced in under the name of 
2-hop covers, and also referred to as landmark labelings [^), each node u G U stores the set of its distances 
to some subset S{u) C V of other nodes of the graph. Then, the computed distance value S'{u,v) for a 
queried pair of nodes u,v G V is returned as: 

S'(u,v) := min S(u,w) + 6(w,v), (1) 

w^S{u)nS{v) 

where 6 denotes the shortest path distance function between a pair of nodes. The computed distance between 
all pairs of nodes u and v is exact if set S{u) H S{v) contains at least one node on some shortest u — v path. 
This property of the family of sets {S(u) : u G V) is known as shortest path eover. The hub-based method of 
distance computation is in practice effective for two reasons. First of all, for transportation-type networks 
it is possible to show bounds on the sizes of sets S, which follow from the network structure. Notably, 
Abraham et al. introduce the notion of highway dimension h oi a network, which is presumed to be a 
small constant e.g. for road networks, and show that an appropriate cover of all shortest paths in the graph 
can be achieved using sets S of size 0{h). Moreover, the order in which elements of sets S{u) and S{v) is 
browsed when performing the minimum operation is relevant, and in some schemes, the operation can be 
interrupted once it is certain that the minimum has been found, before probing all elements of the set. This 
is the principle of numerous heuristics for the exact shortest-path problem, such as contraction hierarchies 
and algorithms with arc flags [9 

In this work, we make use of the hub set techniques to obtain better (distributed) distance labelings. 
Whereas D,{n) is a lower bound of the size of a hub set for general graphs, we provide hub-based schemes 
using smaller sets for specific case, leading to labels which can be encoded on o(n) bits. Our scheme provides 
a shortest-path cover in the class of sparse graphs (with average degree A = 2m/n subpolynomial in n). 
This construction is overviewed in more detail in Section 11.21 
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The implications of our result can be seen as twofold. First of all, our approach directly leads to labeling of 
smaller size (and smaller decoding time) for exact distance queries in sparse graphs than all previous distance 
labeling approaches. Additionally, an corollary of our result concerns the case of k-additive approximate 
distance labeling, in which the distance decoder is required to return an upper bound on the shortest path 
length which is within an additive factor of at most k from the optimum. So far, no way to construct k- 
additive distance labels using labels of sublinear size in n was known for any constant k > 0. (This question 
was considered previously in, e.g., Ill)- We provide a way to construct a 2-additive distance labeling in 
general graphs using distance labels of size o{n). (This result is essentially the best possible, since a 1-additive 
distance labeling requires distance labels of size at least n/4 already on the class of bipartite graphs.) 

In our approaches, the size of the obtained distance labels for the considered cases is improved with 
respect to the state-of-the-art by up to a logarithmic multiplicative factor. Rather than seeing this as 
“gaining” a logarithm, we rather see this as “not losing” a logarithm. Indeed, the basic ingredient of the hub 
sets in previous approaches was a subset of nodes, sampled independently at random from V [^[^. The 
constructions then relied on the probabilistic method to guarantee that the hubs would have the shortest- 
path cover properties, based on the premise that for each pair of nodes the constructed hubs provide a 
shortest path cover with sufficiently high probability. The derandomization of this process resulted in a loss 
of a logarithmic factor in the analysis of the size of labels. Our approach shows how to avoid this issue: 
when constructing labelings for sparse graphs, we do away with randomization altogether, relying on simple 
structural results to replace the random subset of nodes. 


1.1 Related Work 


Distance Labelings. The distance labeling problem in undirected graphs was first investigated by Graham 
and Poliak [^, who provided the first labeling scheme with labels of size 0{n). The decoding time for labels 
of size 0{n) was subsequently improved to O(log log n) by Gavoille et al. and to 0(log* n) by Weimann 
and Peleg [^. Finally, Alstrup et al. present a scheme for general graphs with decoding in 0(1) time 
using labels of size -I- o{n) bits.* This matches up to low order terms the space of the currently best 

known distance oracle with 0{1) time and -I- o(n^) total space in a centralized memory model, due 

to Nitto and Venturini [l^ . 

The notion of D-preserving distance labeling, first introduced by Bollobas et al. 10 , describes a labeling 
scheme correctly encoding every distance that is at least D ^IO] presents such a Z?-preserving scheme of size 
0{^ log^ n). This was recently improved by Alstrup et al. lorto a D-preserving scheme of size 0( log^ D). 
Together with an observation that all distances smaller than D can be stored directly, this results in a 


labeling scheme of size 0{^ log R), where R = 


For specific classes of graphs, Gavoille et al. 


log n 


. For sparse graphs, this is o(n). 


16 described a 0{^/n\ogn) distance labeling for planar 


graphs, together with fl(n^/^) lower bound for the same class of graphs. Additionally, 0{log^ n) upper bound 
for trees and ^{y/n) lower bound for sparse graphs were given. 


Distance Labeling with Hub Sets. For a given graph G, the computational task of minimizing the 
sizes of hub sets {S{u) : u G V) for exact distance decoding is relatively well understood. A G(logn)- 
approximation algorithm for minimizing the average size of a hub set having the sought shortest path cover 
property was presented in Gohen et al. |13| , whereas a (!I(logn)-approximation for minimizing the largest 
hub set at a node was given more recently in Babenko et al. [^. Rather surprisingly, the structural question 
of obtaining bounds on the size of such hub sets for specific graph classes is wide open. For example, for the 
class of graphs of constant maximum degree, there is a large gap between the hub sets in our construction 
(of size 0{n/ logn)) and the generic lower bound of Q.{^/n). 


Distance Oracles. A centralized version of distance labeling problem is distance oracle problem, where 
one asks for a centralized data structure allowing for querying a distance between pair of vertices. There 

*For the sake of sanity of the notation, we define logic = max(l, log2(ic)). 
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usually one asks for what type of tradeoffs are possible between size of the structure, time of the query and 



provided in [2^ . 

1.2 Our Results and Organization of the Paper 

We start by introducing the necessary conventions in Section We also describe the basic building block 
for encoding distance labels, namely, an efficient method of storing the hub set of a node, together with 
corresponding distances, in its distance label. 

In Section we show how to construct an exact distance labeling scheme for graphs of bounded maximum 
degree. This relies on hub sets which consist, for a vertex u of the union of all nodes from a small ball around 
vertex u, and all nodes from a selection of equally-spaced levels of the breadth-first-search tree of u. We then 
apply a trick, known from the previous work of |^, to reduce the problem of constructing a labeling scheme 
for a graph with bounded average degree to that of constructing a labeling scheme for a bounded-degree 
graph on twice as many nodes. For graphs with at most n nodes and average degree A, the tradeoff between 
label bit size L and query decoding time T for our approach is given by L = 0(nlog log^ T/log^ T), for 
any T < n. In particular, setting T = n, we obtain labels of size 0(^logi?), which improves previously 
best result by a factor of logi?, keeping the 0{n) decoding time. On the other end, setting T = logn we 
obtain first sublinear size distance labeling that achieves almost-constant decoding time. 

In Section]^ we adapt our approach to general graphs, using a variant of the proposed labeling scheme for 
sparse graphs to achieve 2-additive approximation of distances. As before, we achieve a tradeoff between label 
size L and time T of the form L — C>(n log^ logT/logT), for any T < n. This 2-additive distance labeling 
scheme can be easily transformed into an exact one, by encoding the difference between the estimation and 
the true distances. Since this difference is always from {0,1,2}, we achieve labels of size -|- o(n) (with 
any a;(l) decoding time), or of size -|- e)n (with 0(1) decoding time), for any e > 0. Our approach 

almost matches the size of the best known distance labeling schemes [^, which make use of labels of size 
-|- o(n) to achieve 0(1) decoding time. Arguably, our approach may be considered simpler. 

We remark that all our results apply to unweighted graphs, in which each edge has unit length. For sparse 
graphs, in which each edge has an integer weight from some interval [1, W], we can use the same hub sets with 
an appropriately modified encoding to achieve a time-label tradeoff of L = 0(n loglog^ Flog IF/log^ T). 
For the additive scheme, by subdividing each edge of length w G [1, IF] into a chain of unweighted edges (of 
length 1), we achieve a conversion of the 2-additive distance labeling scheme into a (2IF)-additive-distance 
scheme for weighted graphs. 

2 Preliminaries 

Notation and Conventions. Even though we are mainly interested in unweighted graphs, for technical 
reasons in Sections]^ andwe will work in a more general setting where every edge of a graph has a fixed cost 
from the set {0,1}. 6{u, v) denotes the cost of a cheapest path connecting a pair of nodes u and v, and £(«,, v) 
denotes the smallest number of edges on such a path. We will require the constructed distance labeling to 
return the value of 6{u,v). The degree of a node v is denoted by deg(u). When analyzing the complexity 
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of the decoding, we assume standard word RAM with logarithmic word size, where we are allowed to access 
log n consecutive bits of the stored binary string in constant time. 

From now on, we assume that the graph is connected. This is enough because we can always include the 
identifier of its connected component in the label of every node, and return cx) if u and v belong to different 
connected components; this only induces additive logn overhead to the label size. 


Encoding Distances and Identifiers. The basic procedure for encoding a hub set in a label exploits 
some ideas from [^; we provide a self-contained exposition for completeness. We fix an arbitrary spanning 
tree of the graph and assign preorder numbers in the tree to the nodes, i.e., node numbered 1 corresponds 
to the root and so on. The preorder number of a node u is denoted by name(it). Such a numbering has the 
following useful property. 

Lemma 2.1. Let ui, U 2 ,..., be the preorder sequence of all nodes. Then, for any node u, X]r =2 
S(u, Ui)| < 2n. 

Proof. Consider an Euler tour corresponding a traversal of the chosen spanning tree. Every node is visited 
at least once there, and the total length of the tour is at most 2n. Consequently, we can cut the tour into 
paths connecting node Vi-i with node Vi, for every z = 2, 3,..., u. The total length of all these paths is at 
most 2n and the claim follows. □ 


The following lemma is used for encoding a hub set S using 0(|S'| log(n/|S'|)) bits. 

Lemma 2.2. For a fixed v and set S such that [S'! < set S and all of the distances 6(y,u) for u G S 
can be stored in 0(y logx) bits. For any constant t > 0, the representation can be augmented with O ( ^ ) 
additional bits so that all elements of S can be extracted one-by-one in 0(|S'|) total time and given any u we 
can check if u € S (and if so, extract 5{u,v)) in 0(1) time. 


Proof. Let S = {vi,... ,U|s|), where name(ui) < name(u 2 ) < ■.. < name(?;| 5 |). We store name(ui) and then 
the differences name(u 2 ) — name(ui),..., name(u| 5 |) — name(u| 5 |_i). Every difference is encoded using the 
Elias 7 code (see Elias [^), and the encodings are concatenated to form one binary string. We are storing up 
to - integers whose absolute values sum up to at most n, so by Jensen’s inequality this takes 0{ - logx) bits 
in total. Similarly, we store 6{u, Ui) and then the differences 6{u, V 2 ) — 5(u, Vi ),..., S(u, U|s|) — S(u, U| 5 |_i). 
By Lemma 2.1 we are again storing up to - numbers whose absolute values sum up to at most 2n, which 
takes O(^loga;) bits. 

All Vi can be extracted one-by-one in 0(1) time each with standard bitwise operations. To facilitate 
checking if x € S' in 0(1) time, we observe that it is enough to store a bit-vector B[l..n\, where the 
name(ui)-th bit is set to 1, for every i = 1,2,..., |S|. Then checking if x € S reduces to two ranki queries. 
A ranki query counts Is in the specified prefix of the bit-vector and a selecti query returns the position of 
the fc-th 1 in the bit-vector. By the result of Patra§cu j^, for any constant t > 0, a bit-vector of length n 
containing - Is can be stored using 


log 


0(^)=0(-logx) + 0(^) 
log n X log n 


bits so that any rank or select query can be answered in Oft) time. This allows us to check if it € S and 
calculate i such that u = Vi'in 0{t) time. To retrieve 5{u,Vi), we store two additional bit-vectors and 
B-. Each of them contains exactly Is and up to 2n Os. The bit-vectors are defined as follows. For each 
1 = 2, 3,..., n we consider the difference 77 = 5{u, Vi) — 5{u, Vi-i). If 17 > 0, we append O’*! to and 1 to 

B-. Otherwise, we append 1 to B+ and 0“’*! to B-. By Lemma 2.1 each of these two bit-vectors contains 
at most 2n Os, so they can be stored using 0{^ logx -I- ^ ) bits so that any rank or select query can be 

answered in Oft) time. To recover S(u,Vi), we need to sum up all the differences. This reduces to summing 
up all positive and all negative differences separately, which can be done using the corresponding bit-vector 
with one ranki and one selecto query in 0(t) total time. □ 
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Figure 1: Shortest path from u to v goes through w which belongs to both By{R') and (offset (u)). 


We remark that the above encoding and decoding scheme is efficient for sets of size IS"! = 0{n). For 
smaller sets, we will simply nse an explicit encoding of all distances in S, requiring 0(|5'| logn) bits. 


3 Exact Distance Labeling in Sparse Graphs 


3.1 Graphs of Bounded Maximum Degree 

In this subsection, we assume that deg(u) < A for every node u. We consider distance labeling schemes 
characterized by a time parameter T. Intuitively, in the construction, R = will be a threshold parameter, 
distingnishing small distances from large distances in the graph — a node will be able to afford to explicitly 
store the distances and identifiers of all nodes up to some distance 0(R) from itself in its distance label. 
Althongh this case is of independent interest, we are considering it as a building block for construction of 
labeling in graphs of bounded average degree. Thns graphs considered here are weighted with edge weights 


from {0,1}, for the reason explained in Section 3.2 


The rest of this subsection is devoted to the proof of the following Theorem. 


Theorem 3.1. Fix any value A < T < n and let R = log^ T. In bounded-degree graphs, there is a labeling 
scheme of size 0{^logR) and decoding time 0{T). 

Let ns denote R' = [i?J. Since R' > 1, we can bound R > R' > \R. Consider a node u. The ball of 
radius r centered at u, denoted Bu{r), is the set of nodes which can be reached from u by following at most 
r edges. Because the degrees of all nodes are bounded by A, |i?„(r)| = 0{A^). The fc-th layer centered at u, 
denoted Lu{k), consists of all nodes v such that £{u,v) = k (mod R)'. Because the layers are disjoint, there 
exists an offset(u) G {0,1,..., R' — 1} snch that |L„(offset(u))| < 


Definition of the Labeling. We define the hub set of node u, to which it stores all its distances, as 
S{u) := Bu{R') U L„(offset(it)), see Fig.[^ Formally, the label of u consists of the following: 

1. n and name(M), 

2 . name(n) and S{u,v) for every v G Bu{R'), 

3. name('(;) and S{u,v) for every v G (offset (u)). 
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Computing S{u,v). For reasons of efficiency, we will not perform the distance decoding following Eq. Q 
directly, but we will treat the two components of the hub set of each node separately. Given name(M) and 
name(?;), we can determine S{u,v) as follows. First we check if r; S Bu{R') and if so return the stored 
S{u,v). Otherwise, we iterate through all nodes w G Bu{R') and check if w S L^(offset(u)). If so, we know 
S(u, w) + 6{w, v). We return the smallest such sum. 

For the proof of correctness of the distance decoder, it is clear that S{u, w) + S{w, v) > S{u, v) for any w, 
so it remains to argue that either v G Bu{R') or there exists w G Bu{R') such that w G Ly(offset(v)) and 
S(u,w) + S(iu,v) = S(u,v). Consider a shortest path (po,Pi,P 2 , ■ ■ -Pe) where v = po and u = pi such that 
£ = £{u,v). If £ < R!, V G Bu{R') and there is nothing to prove, so we can assume that i > R!. Observe 
that for any i = 0,1,... ,£, £(v,Pi) = i, so in particular Pa-R'+offset{v) G L^(ofFset(u)) for any integer a > 0. 


We choose a = ^ offset(«) j ^ Then w G L„(ofFset(u)), w G Bu{R') by the choice of a, 

and 6{u,w) + S{w,v) = d{u,v) because w lies on a shortest path connecting u and v, so indeed we are able 
to correctly determine S{u,v). 


Encoding and Size of the Scheme. Encoding n and name(M) takes O(logn) bits. The set Bu{R') with 
corresponding distance s is s tored explicitly, while set (offset (u)) together with the corresponding distances 

using 0{A^ logn) = C>(Tlogn) and C>(^logi?') bits, respectively. Hence the 

Ti TL 

C>(logn + Tlogn+ — logi?') = C>(-logi?), 

H K 


2.2 


is stored using Lemma 
total size of the scheme is 


where we have used the fact that for any T = poly(n) the claimed label size is the same, thus we can assume 
T = o(n/polylog(n)). 


Complexity of the Decoding. Checking if u G Bu{R') and retrieving the encoded S{u,v) takes 0{T) 
time. Similarly, iterating through all w G Bu{R'), checking if ui G L^(ofFset(u)) and if so retrieving the 
encoded S{v,w) takes, by Lemma [2.2| 0{1) time per single w, thus 0(|i3„(i?')|) = 0{T) total time. All in 
all, we can compute 5{u,v) in 0(T) total time. □ 


Smaller values of T. For the sake of completeness, we consider the special case of T < A. Consider 
labeling where the label of a node u consists of n, name(u), and all values 6{u,v) for v G V stored using 


Lemma 2.2 This takes 0{n) bits, with 0(1) decoding time, and matches claimed bounds from Theorem 3.1 


We also observe that our result applies not only to distance labels, but also as a size upper bound of hub 
sets for sparse graphs. Indeed, by fixing T = n, and observing that |i?„(i?')| + |L„(ofFset(u))| < j^, we have 
the following: 

Corollary 3.2. In bounded-degree graphs, there is a hub set construction of size vertices per node. 


3.2 Graphs of Bounded Average Degree 


We now allow for bounded average degree by reduction to the approach from Subsection |3.1| Given a graph 
G, let A = . We will create a new graph by splitting nodes of high degree. Following the formulation 

from Lemma 4.2] (cf. Figure]^, we can obtain a graph G' on at most 2n nodes and at most m-\-n edges, 
such that the degree of every node is bounded by + 2 < A + 2 and the distance between two nodes in 
the original graph G is exactly the same as the distance between their corresponding nodes in the new graph 


G'. We can now directly apply the scheme from Theorem 3.1 to graph G', and exactly the same distance 
labels will work for the corresponding nodes of graph G. In this way, we obtain a scheme of size: 

G(| logi?+-)=G(| logi?), where i? = G(|°^). 

R n R log A 

which returns 5{u,v) in 0{T) time given the labels of u and v. The correctness of the this reduction is 
guaranteed by the fact that Theorem 3.1 allows for edge weights from {0,1}. 
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Figure 2: Example of subdividing of a large degree node (on the left) into a family of nodes of small degree, 
connected by edges of weight 0 (dashed edges). 


Theorem 3.3. Fix any T <n and A, and let R — 1°^ ^ . There exists an exact distance labeling for graphs 
with average degree A using labels of size 0{^logR) and a corresponding decoding scheme requiring time 
0{T). □ 

It is easy to see that this reduction preserves bounds on the size of hub sets, so we have the following: 

Corollary 3.4. In graphs with average degree A, there is a hub set construction of size ^ ) vertices 

per node. 


4 2-Additive Distance Labeling in General Graphs 


We will apply a similar distance labeling scheme as for sparse graphs, obtaining a 2-additive approximation 
of the distance between any pair of with label sizes of o(n) per node. In this approximate scheme, the hub 
sets will have the following property. The label of each node u € V will provide an encoding of the node 
identifiers of a subset S{u) C V and of the distances from u to all elements of S{u). The sets S{u) will be 
defined so that for any pair u, v, there exists a node w € S(u) H S{v), such that either w or a neighbor of w 
lies on the shortest path from u to u in G. We will decode the approximate distance as before, using Eq. Q; 
clearly, 6'{u, v) € 6{u, v) + { 0 , 1 , 2 }. 

The construction of sets S{u) is performed as follows. Let t | logn be an threshold value of vertex 
degree, to be chosen later. Let V = {v & V : deg(u) > r}, and let S" C V be a minimal dominating set 
for V', i.e., a subset of V with the property: Vuj£V'Bw{\) H S" ^ 0. By a straightforward application of the 
probabilistic method (cf. proof of Theorem 1.2.2]), we have that there is S' such that |S"| < ^ 

^ n, and it can be easily constructed in polynomial time (a deterministic construction by a folklore greedy 
algorithm gives set of size 0(—n)). For every u G V, we define B'^{r) as the set of nodes of the ball of 
radius r around u in the subgraph G\V \ V']. Finally, we define R = and let Lu, R', and offset(u) be 
defined as in Section 3.1 and let be a minimal subset of S' such that for every w G V adjacent the 
boundary of B'^R'), i.e. 5^,(1) n 5(^(1?') 7 ^ 0, we have Buj(l) n yf 0. Such S!^^ can be easily constructed 
in polynomial time, and moreover, since there are at most vertices adjacent to the boundary, we have 

|5;| = 20 M. 

The approximate distance label of u now consists of the following elements: 


1. n and name(M), 

2 . name(u) and S{u,v) 

3. name('(;) and S{u,v) 

4. name(u) and S{u,v) 

5. name(u) and d{u,v) 
The separation of S' into 


for every v G B'^R'), 
for every v G (offset (u)). 

for every v G S'^, 
for every v G S' \ S'.^. 

5^ and S' \ S'^ in the label is done to allow efficient decoding. 






Computing S{u,v). To show the correctness of this approximate labeling scheme, fix a pair of vertices 
u,v G V. If there exists a vertex w G B'^{R') lying on a fixed shortest path P between u and v such that 
w = V or w G L„(offset(r')), then the labeling scheme finds the shortest path distance between u and v as in 
Otherwise, let y be the nearest vertex to u lying in P \ it follows from the construction 


Section 3.1 


that y G V . Then, there exists w G By{l) such that w G S'^ C S' . In this case, the distance i5(m, w) +S{v, w) 
is a 2-additive approximation of (5 (m, v). 


Size of the Scheme. The size of the label of a node u in the scheme can be bounded as follows: \B'^{R')\ < 
^ 2'^^ |L„(offset(u))| < ^, S' < Overall, the total size is |i?(,(i?')ULu(ofFset(u))US''| = 

thus using Lemma 2.2 to store the sets and the corresponding distances we obtain labels of size ''' ). 


Complexity of the Decoding. To perform the distance decoding, for a given pair u,v G V^ it suffices 
to minimize 5{u, w) 5{v, w) over all w belonging to B'^(R') U S'^ which are also encoded in the label of v. 
Hence, distance decoding is possible in time 2®*^"^). Overall, setting T := we obtain the following main 

result of the section. 

Theorem 4.1. There is a 2-additive distance labeling scheme for general graphs, which achieves decoding 
time T using labels of size for any T <n. □ 

Finally, we remark on some implications of our result. By a standard argument, converting a 2-additive 
approximate distance labeling into an exact one requires an additional label of size ^ n bits per node (and 
an additional overhead in the space, which is negligible), with each node u encoding the difference 

between the approximate and real distance value, S'{u,v) — S{u,v), for all v G {(u -I- 1) mod n,..., (rt -f 

) mod n}. The time overhead of the corresponding decoding is 0(1). In an analogous manner, converting 
a 2 -additive approximate distance labeling into an 1 -additive approximate one requires an additional label 
of size bits per node. Thus we convert our scheme into an exact distance labeling scheme or 1-additive 

scheme achieving T decoding time using labels of size respectively ^ n-|-0(n ^ ) or ln-|-0(n *°^^g‘^^ ), 

for any T < n. 

Thus, setting r as an arbitrarily small increasing function of n, for any desired decoding time T = a;(l) 
we can make use of labels of size o(n), \n -\- o{n) and ^ n + o{n) respectively for 2 -additive, 1 -additive 
and exact distances. Moreover, using this scheme, 0(1) decoding time can be achieved for labels of size en, 
-G e) ■ n and ( ^ + s) ■ n, for any absolute constant e > 0 . 

While a slightly stronger in terms of decoding time schemes were presented in Alstrup et al. (achieving 
0 ( 1 ) decoding time and labels of size ^ n + oin) and -f o{n) for exact and 1 -additive distances), we 
believe that presented here schemes are of independent value due to the simplification of the construction. 
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